A Tutorial on Probabilistic Latent Semantic Analysis

نویسنده

  • Liangjie Hong
چکیده

Historically, many believe that these three papers [7, 8, 9] established the techniques of Probabilistic Latent Semantic Analysis or PLSA for short. However, there also exists one variant of the model in [11] and indeed all these models were originally discussed in an earlier technical report [10]. In [2], the authors extended MLE-style estimation of PLSA to MAP-style estimations. A hierarchical extension was proposed in [6]. In [4], the authors showed the equivalent between PLSA and another popular method, non-negative matrix factorization. A high order of proof was shown in [12]. The equivalent between PLSA and LDA was shown in [5]. More recently, a new MAP estimation algorithm is proposed in [13].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Query expansion based on relevance feedback and latent semantic analysis

Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...

متن کامل

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis is a novel statistical technique for the analysis of two{mode and co-occurrence data, which has applications in information retrieval and ltering, natural language processing, machine learning from text, and in related areas. Compared to standard Latent Semantic Analysis which stems from linear algebra and performs a Singular Value Decomposition of co-occu...

متن کامل

Query Expansion Using a Collection Dependent Probabilistic Latent Semantic Thesaurus

Many queries on collections of text documents are too short to produce informative results. Automatic query expansion is a method of adding terms to the query without interaction from the user in order to obtain more refined results. In this investigation, we examine our novel automatic query expansion method using the probabilistic latent semantic thesaurus, which is based on probabilistic lat...

متن کامل

On Automatic Annotation of Images with Latent Space Models

Image auto-annotation, i.e., the association of words to whole images, has attracted considerable attention. In particular, unsupervised, probabilistic latent variable models of text and image features have shown encouraging results, but their performance with respect to other approaches remains unknown. In this paper, we apply and compare two simple latent space models commonly used in text an...

متن کامل

Improving Probabilistic Latent Semantic Analysis with Principal Component Analysis

Probabilistic Latent Semantic Analysis (PLSA) models have been shown to provide a better model for capturing polysemy and synonymy than Latent Semantic Analysis (LSA). However, the parameters of a PLSA model are trained using the Expectation Maximization (EM) algorithm, and as a result, the trained model is dependent on the initialization values so that performance can be highly variable. In th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1212.3900  شماره 

صفحات  -

تاریخ انتشار 2012